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GAATTCGGAG GAATTATTCA AAACAT AAAG^. AC AA T AAACA ATTTGAGTAG TTGCCGCACA 60 
CACACACA(||epGCC^ , 20 

GCAACAAAAA CATCAATAAA C ATG CAT TGG ATT AAA TGT TTA TTA ACA GCA 171 

Met His Trp lie Lys Cys Leu Leu Thr Ala 
1 5 10 

TTC ATT TGC TTC ACA GTC ATC GTG CAG GTT CAC AGT TCC GGC AGC TTT 219 
Phe He Cys Phe Thr Val He Val Gin Vat His Ser Ser Gly Ser Phe 

15 20 25 

GAG TTG CGC CTG AAG TAC TTC AGC AAC GAT CAC GGG CGG GAC AAC GAG 267 
Glu Leu Arg Leu Lys Tyr Phe Ser Asn Asp His Gly Arq Asp Asn Glu 

30 35 40 

GGT CGC TGC TGC AGC GGG GAG TCG GAC GGA GCG ACG GGC AAG TGC CTG 315 
Gly Arg Cys Cys Ser Gly Glu Ser Asp Gly Ala Thr Gly Lys Cys Leu 

45 50 55 

GGC AGC TGC AAG ACG CGG TTT CGC GTC TGC CTA AAG CAC TAC CAG GCC 363 
Gly Ser Cys Lys Thr Arg Phe Arg Vol Cys Leu Lys His Tyr Gin Ala 
60 65 70 

ACC ATC GAC ACC ACC TCC CAG TGC ACC TAC GGG GAC GTG ATC ACG CCC 411 
Thr lie Asp Thr Thr Ser Gin Cys Thr Tyr Gly Asp Val He Thr Pro 
75 80 85 90 

ATT CTC GGC GAG AAC TCG GTC AAT CTG ACC GAC GCC CAG CGC TTC CAG 459 
He Leu Gly Glu Asn Ser Val Asn Leu Thr Asp Ala Gin Arg Phe Gin 

95 100 105 

AAC AAG GGC TTC ACG AAT CCC ATC CAG TTC CCC TTC TCG TTC TCA TGG 507 
Asn Lys Gly Phe Thr Asn Pro He Gin Phe Pro Phe Ser Phe Ser Trp 

HO 115 120 



FIG. 1 A 



CCG GGT ACC TTC TCG CTG ATC GTC GAG GCC TGG CAT GAT ACG AAC AAT 
Pro Gly Thr Phe Ser Leu He Val Glu Ala Trp His Asp Thr Asn Asn 
125 130 _ 135 

AGC GGC AAT GCG CGA ACC AAC AAG CTC CTC ATC CAG CGA CTC TTG GTG 
Ser Gly Asn Ala Arg Thr Asn Lys Leu Leu He Gin Arq Leu Leu Vol 
140 H5 150 

CAG CAG GTA CTG GAG GTG TCC TCC GAA TGG AAG ACG AAC AAG TCG GAA 
Gin Gin Vol Leu Glu Val Ser Ser Glu Trp Lys Thr Asn Lys Ser Glu 
,55 160 165 170 

TCG CAG TAC ACG TCG CTG GAG TAC GAT TTC CGT GTC ACC TGC GAT CTC 
Ser Gin Tyr Thr Ser Leu Glu Tyr Asp Phe Arg Vol Thr Cys Asp Leu 

175 180 185 

AAC TAC TAC GGA TCC GGC TGT GCC AAG TTC TGC CGG CCC CGC GAC GAT 
Asn Tyr Tyr Gly Ser Gly Cys Ala Lys Phe Cys Arg Pro Arg Asp Asp 

190 195 200 

TCA TTT GGA CAC TCG ACT TGC TCG GAG ACG GGC GAA ATT ATC TGT TTG 
Ser Phe Gly His Ser Thr Cys Ser Glu Thr Gly Glu He He Cys Leu 
205 210 815 

ACC GGA TGG CAG GGC GAT TAC TGT CAC ATA CCC AAA TGC GCC AAA GGC 
Thr Gly Trp Gin Gly Asp Tyr Cys His He Pro Lys Cys Ala Lys Glv 
220 225 230 

TGT GAA CAT GGA CAT TGC GAC AAA CCC AAT CAA TGC GTT TGC CAA CTG 
Cys Glu His Gly His Cys Asp Lys Pro Asn Gin Cys Val Cys Gin Leu 
235 240 245 250 

GGC TGG AAG GGA GCC TTG TGC AAC GAG TGC GTT CTG GAA CCG AAC TGC 
Gly Trp Lys Gly Ala Leu Cys Asn Glu Cys Val Leu Glu Pro Asn Cys 

255 260 265 
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ATC CAT GGC ACC TGC AAC AAA CCC TGG ACT TGC ATC TGC AAC GAG GGT 
He His Gly Thr Cys Asn Lys Pro Trp Thr Cys lie Cys Asn Glu Gly 

270 . 275 . 280 

TGG GGA GGC TTG TAC TGC AAC CAG GAT CTG AAC TAC TGC ACC AAC CAC 
Trp Gly Gly Leu Tyr Cys Asn Gin Asp Leu Asn Tyr Cys Thr Asn His 
885 290 295 

AGA CCC TGC AAG AAT GGC GGA ACC TGC TTC AAC ACC GGC GAG GGA TTG 
Arg Pro Cys Lys Asn Gly Gly Thr Cys Phe Asn Thr Gly Glu Gly Leu 
300 305 310 

TAC ACA TGC AAA TGC GCT CCA GGA TAC AGT GGT GAT GAT TGC GAA AAT 
Tyr Thr Cys Lys Cys Ala Pro Gly Tyr Ser Gly Asp Asp Cys Glu Asn 
3,5 320 325 330 

GAG ATC TAC TCC TGC GAT GCC GAT GTC AAT CCC TGC CAG AAT GGT GGT 
Glu He Tyr Ser Cys Asp Ala Asp Val Asn Pro Cys Gin Asn Gly Gly 

335 340 345 

ACC TGC ATC GAT GAG CCG CAC ACA AAA ACC GGC TAC AAG TGT CAT TGC 
Thr Cys He Asp Glu Pro His Thr Lys Thr Gly Tyr Lys Cvs His Cvs 

350 355 360 

GCC AAC GGC TGG AGC GGA AAG ATG TGC GAG GAG AAA GTG CTC ACG TGT 
Aid Asn Gly Trp Ser Gly Lys Met Cys Glu Glu Lys Val Leu Thr Cys 
365 370 375 

TCG GAC AAA CCC TGT CAT CAG GGA ATC TGC CGC AAC GTT CGT CCT GGC 
Ser Asp Lys Pro Cys His Gin Gly He Cys Arg Asn Val Arq Pro Glv 
380 385 390 

TTG GGA AGC AAG GGT CAG GGC TAC CAG TGC GAA TGT CCC ATT GGC TAC 
Leu Gly Ser Lys Gly Gin Gly Tyr Gin Cys Glu Cys Pro He Gly Tyr 
395 400 405 410 
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1 

AGC GGA CCC AAC TGC GAT- CTC CAG CTG GAC AAC TGC AGT CCG AAT CCA 

Ser Gly Pro Asn Cys Asp Leu Gin Leu Asp Asn Cys Ser Pro Asn Pro 

415 ^ 42-0 • 425 

TGC ATA AAC GGT GGA AGC TGT CAG CCG AGC GGA AAG TGT ATT TGC CCA 
Cys He Asn Gly Gly Ser Cys Gin Pro Ser Gly Lys Cys He Cys Pro 

430 435 440 

GCG GGA TTT TCG GGA ACG AGA TGC GAG ACC AAC ATT GAC GAT TGT CTT 
Ala Gly Phe Ser Gly Thr Arg Cys Glu Thr Asn He Asp Asp Cys Leu 
44 5 450 455 

GGC CAC CAG TGC GAG AAC GGA GGC ACC TGC ATA GAT ATG GTC AAC CAA 
Gly His Gin Cys Glu Asn Gly Gly Thr Cys He Asp Met Val Asn Gin 
460 465 470 

TAT CGC TGC CAA TGC GTT CCC GGT TTC CAT GGC ACC CAC TGT AGT AGC 
Tyr Arg Cys Gin Cys Val Pro Gly Phe His Gly Thr His Cys Ser Ser 
475 480 485 490 

AAA GTT GAC TTG TGC CTC ATC AGA CCG TGT GCC AAT GGA GGA ACC TGC 
Lys Val Asp Leu Cys Leu He Arg Pro Cys Ala Asn Gly Gly Thr Cvs 

495 500 505 

TTG AAT CTC AAC AAC GAT TAC CAG TGC ACC TGT CGT GCG GGA TTT ACT 
Leu Asn Leu Asn Asn Asp Tyr Gin Cys Thr Cys Arg Ala Gly Phe Thr 

510 515 520 

GGC AAG GAT TGC TCT GTG GAC ATC GAT GAG TGC AGC AGT GGA CCC TGT 
Gly Lys Asp Cys Ser Val Asp He Asp Glu Cys Ser Ser Gly Pro Cvs 
525 530 535 

CAT AAC GGC GGC ACT TGC ATG AAC CGC GTC AAT TCG TTC GAA TGC GTG 
His Asn Gly Gly Thr Cys Met Asn Arg Val Asn Ser Phe Glu Cys Val 
540 545 550 
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TGT GCC AAT GOT TTC AGG GGC AAG -CAG TGC GAT GAG GAG TCC TAC GAT 1851 
Cys Ala Asn Gly Phe Arg Gly.Lys Gin Cys Asp Glu Glu Ser Tyr Asp 

555 560 • .. 565. 570 



TCG GTG ACC TTC GAT GCC CAC CAA TAT GGA GCG ACC ACA CAA GCG AGA 1899 
Ser Val Thr Phe Asp Ala His Gin Tyr Gly Ala Thr Thr Gin Ala Arg 

575 580 585 

GCC GAT GGT TTG ACC AAT GCC CAG GTA GTC CTA ATT GCT GTT TTC TCC 1947 
Ala Asp Gly Leu Thr Asn Ala Gin Val Val Leu lie Ala Val Phe Ser 

590 595 600 

GTT GCG ATG CCT TTG GTG GCG GTT ATT GCG GCG TGC GTG GTC TTC TGC 1995 
Val Ala Met Pro Leu Val Ala Val lie Ala Ala Cys Val Val Phe Cys 
605 610 615 

ATG AAG CGC AAG CGT AAG CGT GCT CAG GAA AAG GAC GAC GCG GAG GCC 2043 
Met Lys Arg Lys Arg Lys Arg Ala Gin Glu Lys Asp Asp Ala Glu Ala 
620 625 630 

AGG AAG CAG AAC GAA CAG AAT GCG GTG GCC ACA ATG CAT CAC AAT GGC 2091 
Arg Lys Gin Asn Glu Gin Asn Ala Val Ala Thr Met His His Asn Gly 
635 640 645 650 

AGT GGG GTG GGT GTA GCT TTG GCTTCA GCC TCT CTG GGC GGC AAA ACT 2139 
Ser Gly Val Gly Val Ala Leu Ala Ser Ala Ser Leu Gly Gly Lys Thr 

655 660 665 

i- 

GGC AGC AAC AGC GGT CTC ACC TTC GAT GGC GGC AAC CCG AAT ATC ATC 2187 
Gly Ser Asn Ser Gly Leu Thr Phe Asp Gly Gly Asn Pro Asn He He 

670 675 680 

AAA AAC ACC TGG GAC AAG TCG GTC AAC AAC ATT TGT GCC TCA GCA GCA 2235 
Lys Asn Thr Trp Asp Lys Ser Val Asn Asn He Cys Ala Ser Ala Ala 
685 690 695 
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GCA GCG GCG GCG GCG GCA GCA GCG GCG GAC GAG TGT CTC ATG TAC GGC 2283 
Ala Ala Ala Ala Ala Ala Ala Ala Ala Asp Glu Cys Leu Met Tyr Gly 
700 705 i 710 



GGA TAT GTG GCC TCG GTG GCG GAT AAC AAC^AAf GCC AAC TCA GAC TTT 2331 
Gly Tyr Val Ala Ser Val Ala Asp Asn Asn Asn Ala Asn Ser Asp Phe 
715 720 725 730 

TGT GTG GCT CCG CTA CAA AGA GCC AAG TCG CAA AAG CAA CTC AAC ACC 2379 
Cys Val Ala Pro Leu Gin Arg Ala Lys Ser Gin Lys Gin Leu Asn Thr 

735 740 745 

GAT CCC ACG CTC ATG CAC CGC GGT TCG CCG GCA GGC AGC TCA GCC AAG 2427 
Asp Pro Thr Leu Met His Arg Gly Ser Pro Ala Gly Ser Ser Ala Lys 

750 755 760 

GGA GCG TCT GGC GGA GGA CCG GGA GCG GCG GAG GGC AAG AGG ATC TCT 2475 
Gly Ala Ser Gly Gly Gly Pro Gly Ala Ala Glu Gly Lys Arg He Ser 
765 770 775 

GTT TTA GGC GAG GGT TCC TAC TGT AGC CAG CGTTGG CCC TCG TTG GCG 2523 
Val Leu Gly Glu Gly Ser Tyr Cys Ser Gin Arg Trp Pro Ser Leu Ala 
780 785 790 

GCG GCG GGA GTG GCC GGA GCC TGT TCA TCC CAG CTA ATG GCT GCA GCT 2571 
Ala Ala Gly Val Ala Gly Ala Cys Ser Ser Gin Leu Met Ala Ala Ala 
795 800 805 810 

TCG GCA GCG GGC AGC GGA GCG GGG ACG GCG CAA CAG CAG CGA TCC GTG 2619 
Ser Ala Ala Gly Ser Gly Ala Gly Thr Ala Gin Gin Gin Arg Ser Val 

815 820 825 

GTC TGC GGC ACT CCG CAT ATG TAACTCCAAA AATCCGGAAG GGCTCCTGGT 2670 
Val Cys Gly Thr Pro His Met 

830 

AAATCCGGAG AAATCCGCAT GGAGGAGCTG ACAGCACATA CACAAAGAAA AGACTGGGTT 2730 

GGGTTCAAAA TGTGAGAGAG ACGCCAAAAT GTTGTTGTTG ATTGAAGCAG TTTAGTCGTC 2790 

ACGAAAAATG AAAAATCTGT AACAGGCATA ACTCGTAAAC TCCCTAAAAA ATTTGTATAG 2850 

TAATTAGCAA AGCTGTGACC CAGCCGTTTC GATCCCGAAT TC 2892 
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1 GAATTCCGCT GGGAGAAIGG TCTGAGCTAC CTGCCCGTCC TGCTGGGGCA TCAATGGCAA 

61 GTGGGGAAAG CCACACTGGG CAAACGGGCC, AGGCCAITTC TGGAATGTGG TACATGGTGG 

121 GCAGGGGGCC CGCAACAGCT GGAGGGCAGG TGGACTGAGG CTGGGGATCC CCCGCTGGTT 

181 GGGCAATACT GCCTTTACCC ATGAGCTGGA AAGTCACAAT GGGGGGCAAG GGCTCCCGAG 

241 GGTGGTTATG TGCTTCCTTC AGGTGGC 



FIG.8A 



1 GAATTCCTTC CATTATACGT GACTTTTCTG AAACTGTAGC CACCCTAGTG TCTCTAACTC 

61 CCTCTGGAGT TTGTCAGCTT TGGTCTTTTC AAAGAGCAGG CTCTCTTCAA GCTCCTTAAT 

121 GCGGGCATGC TCCAGTTTGG TCTGCGTCTC AAGATCACCT TTGGTAATTG ATTCTTCTTC 

181 AACCCGGAAC TGAAGGCTGG CTCTCACCCT CTAGGCAGAG CAGGAATTCC GAGGTGGATG 

241 TGTTAGATGT GAATGTCCGT GGCCCAGATG GCTGCACCCC ATTGATGTTG GCTTCTCTCC 

301 GAGGAGGCAG CTCAGATTTG AGTGATGAAG ATGAAGATGC AGAGGACTGT TCTGCTAACA 

361 TCATCACAGA CTTGGTCTAC CAGGGTGCCA GCCTCCAGAC CAGACAGACC GGACTGGTGA 

421 GATGGCCCTG CACCTTGCAG CCCGCTACTC ACGGGCTGAT GCTGCCAAGC GTCTCCTGGA 

481 TGCAGGTGCA GATGCCAATG CCCAGGACAA CATGGGCCGC TGTCCACTCC ATGCTGCAGT 

541 GGCACGTGAT GCCAAGGTGT ATTCAGATCT GTTA 



FIG.8B 



1 TCCAGATTCT GATTCGCAAC CGAGTAACTG ATCTAGATGC CAGGATGAAT GATGGTACTA 

61 CACCCCTGAT CCTGGCTGCC CGCCTGGCTG TGGAGGGAAT GGTGGCAGAA CTGATCAACT 

121 GCCAAGCGGA TGTGAATGCA GTGGATGACC ATGGAAAATC TGCTCTTCAC TGGGCAGCTG 

181 CTGTCAATAA TGTGGAGGCA ACTCTTTTGT TGTTGAAAAA TGGGGCCAAC CGAGACATGC 

241 AGGACAACAA GGAAGAGACA CCTCTGTTTC TTGCTGCCCG GGAGGAGCTA TAAGC 



FIG.8C 



1 GAATTCCATT CAGGAGGAAA GGGTGGGGAG AGAAGCAGGC ACCCACTTTC CCGTGGCTGG 

61 ACTCGTTCCC AGGTGGCTCC ACCGGCAGCT GTGACCGCCG CAGGTGGGGG CGGAGTGCCA 

121 TTCAGAAAAT TCCAGAAAAG CCCTACCCCA ACTCGGACGG CAACGTCACA CCCGTGGGTA 

181 GCAACTGGCA CACAAACAGC CAGCGTGTCT GGGGCACGGG GGGATGGCAC CCCCTGCAGG 

24 1 CAGAGCTG 



FIG.9A 



1 CTAAAGGGAA CAAAAGCNGG AGCTCCACCG CGGGCGGCNC NGCTCTAGAA CTAGTGGANN 

61 NCCCGGGCTG CAGGAATTCC GGCGGACTGG GCTCGGGCTC AGAGCGGCGC TGTGGAAGAG 

121 ATTCTAGACC GGGAGAACAA GCGAATGGCT GACAGCTGGC CTCCAAAGTC ACCAGGCTCA 

181 AATCGCTCGC CCTGGACATC GAGGGATGCA GAGGATCAGA ACCGGTACCT GGATGGCATG 

241 ACTCGGATTT ACAAGCATGA CCAGCCTGCT TACAGGGAGC GTGANNTTTT CACATGCAGT 

301 CGACAGACAC GAGCTCTATG CAT 



FIG.9B 
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G GAG GTG GAT GTG TTA GAT GTG AAT GTC CGT GGC CCA GAT GGC TGC 
Glu Vol Asp Vol Leu Asp Val Asn Vol Arg Gly Pro Asp Gly Cys 

1 5 . .10 / , 15 

ACC CCA TTG ATG TTG GCT TCT CTC CGA GGA GGC AGC TCA GAT TTG AGT 
Thr Pro Leu Met Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser 

?0 '25 50 

GAT GAA GAT GAA GAT GCA GAG GAC TCT TCT GCT AAC ATC ATC ACA GAC 
Asp Glu Asp Glu Asp Ala Glu Asp Ser Ser Ala Asn He He Thr Asp 

35 40 45 

TTG GTC TAC CAG GGT GCC AGC CTC CAG GCC CAG ACA GAC CGG ACT GGT 
Leu Val Tyr Gin Gly Ala Ser Leu Gin Ala Gin Thr Asp Arq Thr Gly 

50 55 60 

GAG ATG GCC CTG CAC CTT GCA GCC CGC TAC TCA CGG GCT GAT GCT GCC 
Glu Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 
65 70 75 

AAG CGT CTC CTG GAT GCA GGT GCA GAT GCC AAT GCC CAG GAC AAC ATG 
Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gin Asp Asn Met 
80 85 90 95 

GGC CGC TGT CCA CTC CAT GCT GCA GTG GCA GCT GAT GCC CAA GGT GTC 
Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gin Gly Val 

100 105 HO 

TTC CAG ATT CTG ATT CGC AAC CGA GTA ACT GAT CTA GAT GCC AGG ATG 
Phe Gin He Leu He Arg Asn Arg Val Thr Asp Leu Asp Ala Arg Met 

U5 120 125 

AAT GAT GGT ACT ACA CCC CTG ATC CTG GCT GCC CGC CTG GCT GTG GAG 
Asn Asp Gly Thr Thr Pro Leu He Leu Ala Ala Arg Leu Ala Val Glu 
130 135 140 



FIG.11A 



GGA ATG GTG GCA GAA CTG ATC AAC TGC CAA GCG GAT GTG AAT GCA GTG 
Gly Met Vat Ala Glu Leu He Asn Cys Gin Ala-Asp Vol Asn Ala Val 
145 150 155 

GAT GAC CAT GGA AAA TCT GCT CTT CAC TGG GCA GCT GCT GTC AAT AAT 
Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala Ala Val Asn Asn 
!60 165 170 175 

GTG GAG GCA ACT CTT TTG TTG TTG AAA AAT GGG GCC AAC CGA GAC ATG 
Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly Ala Asn Arg Asp Met 

180 185 190 

CAG GAC AAC AAG GAA GAG ACA CCT CTG TTT CTT GCT GCC CGG GAG GGG 
Gin Asp Asn Lys Glu Glu Thr Pro Leu Phe Leu Ala Ala Arq Glu Glv 

195 200 205 

AGC TAT GAA GCA GCC AAG ATC CTG TTA GAC CAT TTT GCC AAT CGA GAC 
Ser Tyr Glu Ala Ala Lys He Leu Leu Asp His Phe Ala Asn Aro Asd 
210 215 220 

ATC ACA GAC CAT ATG GAT CGT CTT CCC CGG GAT GTG GCT CGG GAT CGC 
He Thr Asp His Met Asp Arg Leu Pro Arg Asp Val Ala Arg Asp Arg 
225 230 235 

ATG CAC CAT GAC ATT GTG CGC CTT CTG GAT GAA TAC AAT GTG ACC CCA 
Met His His Asp lie Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro 
240 245 250 255 

AGC CCT CCA GGC ACC GTG TTG ACT TCT GCT CTC TCA CCT GTC ATC TGT 
Ser Pro Pro Gly Thr Val Leu Thr Ser Ala Leu Ser Pro Val He Cys 

260 265 270 

GGG CCC AAC AGA TCT TTC CTC AGC CTG AAG CAC ACC CCA ATG GGC AAG 
Gly Pro Asn Arg Ser Phe Leu Ser Leu Lyn His Thr Pro Met Gly Lys 

275 280 285 



FIG.11B 



AAG TCT AGA CGG CCC AGT GCC AAG AGT ACC ATG CCT ACT AGC CTC CCT 910 
Lys Ser Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro 
290 * 295 300 

AAC CTT GCC AAG GAG GCA AAG GAT GCC AAG GGT AGT AGG AGG AAG AAG 958 
Asn Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arq Lys Lys 
305 310 315 

TCT CTG AGT GAG AAG GTC CAA CTG TCT GAG AGT TCA GTA ACT TTA TCC 1006 
Ser Leu Ser Glu Lys Vat Gin Leu Ser Glu Ser Ser Vol Thr Leu Ser 
320 325 330 335 

CCT GTT GAT TCC CTA GAA TCT CCT CAC ACG TAT GTT TCC GAC ACC ACA 1054 
Pro Vol Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 

340 345 350 

TCC TCT CCA ATG ATT ACA TCC CCT GGG ATC TTA CAG GCC TCA CCC AAC 1102 
Ser Ser Pro Met He Thr Ser Pro Gly He Leu Gin Ala Ser Pro Asn 

355 360 365 

CCT ATG TTG GCC ACT GCC GCC CCT CCT GCC CCA GTC CAT GCC CAG CAT 1150 
Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Vol His Ala Gin His 
370 375 380 

GCA CTA TCT TTT TCT AAC CTT CAT GAA ATG CAG CCT TTG GCA CAT GGG 1198 
Ala Leu Ser Phe Ser Asn Leu His Glu Met Gin Pro Leu Ala His Gly 
385 390 395 

GCC AGC ACT GTG CTT CCC TCA GTG AGC CAG TTG CTA TCC CAC CAC CAC 1246 
Ala Ser Thr Vol Leu Pro Ser Val Ser Gin Leu Leu Ser His His His 
400 405 410 415 

ATT GTG TCT CCA GGC AGT GGC AGT GCT GGA AGC TTG AGT AGG CTC CAT 1294 
lie Val Ser Pro Gly Ser Gly Ser Ala Gly Ser Leu Ser Arg Leu His 

420 425 430 



CCA GTC CCA GTC CCA GCA GAT TGG ATG AAC CGC ATG GAG GTG AAT GAG 1342 
Pro Val Pro Val Pro Ala Asp Trp Met Asn Arg Met Glu Val Asn Glu 

435 440 445 



FIG.11C 



ACC CAG TAC AAT GAG ATG TTT GGT ATG GTC CTG GCT CCA GCT GAG GGC 1390 
Thr Gin Tyr Asn Glu Met Phe Gly Met Vol Leu, Ala Pro Ala Glu Gly 
450 455 460 



ACC CAT CCT GGC ATA GCT CCC CAG AGC AGG CCA CCT GAA GGG AAG CAC 
Thr His Pro Gly He Ala Pro Gin Ser Arg Pro Pro Glu Gly Lys His 
465 470 475 

ATA ACC ACC CCT CGG GAG CCC TTG CCC CCC ATT GTG ACT TTC CAG CTC 
He Thr Thr Pro Arg Glu Pro Leu Pro Pro He Vol Thr Phe Gin Leu 
480 485 490 495 



1438 



1486 



ATC CCT AAA GGC AGT ATT GCC CAA CCA GCG GGG GCT CCC CAG CCT CAG 1534 
He Pro Lys Gly Ser He Ala Gin Pro Ala Gly Ala Pro Gin Pro Gin 

500 505 510 

TCC ACC TGC CCT CCA GCT GTT GCG GGC CCC CTG CCC ACC ATG TAC CAG 1582 
Ser Thr Cys Pro Pro Ala Vol Ala Gly Pro Leu Pro Thr Met Tyr Gin 

515 520 525 

ATT CCA GAA ATG GCC CGT TTG CCC AGT GTG GCT TTC CCC ACT GCC ATG 1630 
He Pro Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met 
530 535 540 

ATG CCC CAG CAG GAC GGG CAG GTA GCT CAG ACC ATT CTC CCA GCC TAT 1678 
Met Pro Gin Gin Asp Gly Gin Val Ala Gin Thr He Leu Pro Ala Tyr 
545 550 555 

CAT CCT TTC CCA GCC TCT GTG GGC AAG TAC CCC ACA CCC CCT TCA CAG 1726 
His Pro Phe Pro Ala Ser Vol Gly Lys Tyr Pro Thr Pro Pro Ser Gin 
560 565 . 570 575 



CAC AGT TAT GCT TCC TCA AAT GCT GCT GAG CGA ACA CCC AGT CAC AGT 
His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 

580 585 590 



1774 



GGT CAC CTC CAG GGT GAG CAT CCC TAC CTG ACA CCA TCC CCA GAG TCT 1822 
Gly His Leu Gin Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu Ser 

595 600 605 



FIG. 1 1 D 



CCT GAC CAG TGG TCA AGT TCA TCA CCC CAC TCT GCT TCT GAC TGG TCA 1870 
Pro Asp Gin Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp Trp Ser 
610 615 .620 

GAT GTG ACC ACC AGC CCT ACC CCT GGG GGT GCT GGA GGA GGT CAG CGG 1918 
Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly Gly Gin Arg 
625 630 635 

GGA CCT GGG ACA CAC ATG TCT GAG CCA CCA CAC AAC AAC ATG CAG GTT 1966 
Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn Asn Met Gin Val 
640 645 650 655 

TAT GCG TGAGAGAGTC CACCTCCAGT GTAGAGACAT AACTGACTTT TGTAAATGCT 2022 
Tyr Ala 



GCTGAGGAAC AAATGAAGGT CATCCGGGAG AGAAATGAAG AAATCTCTGG AGCCAGCTTC 2082 

TAGAGGTAGG AAAGAGAAGA TGTTCTTATT CAGATAATGC AAGAGAAGCA ATTCGTCAGT 2142 

TTCACTGGGT ATCTGCAAGG CTTATTGATT ATTCTAATCT AATAAGACAA GTTTGTGGAA 2202 

ATGCAAGATG AATACAAGCC TTGGGTCCAT GTTTACTCTC TTCTATTTGG AGAATAAGAT 2262 

GGATGCTTAT TGAAGCCCAG ACATTCTTGC AGCTTGGACT GCATTTTAAG CCCTGCAGGC 2322 

TTCTGCCATA TCCATGAGAA GATTCTACAC TAGCGTCCTG TTGGGAATTA TGCCCTGGAA 2382 

TTCTGCCTGA ATTGACCTAC GCATCTCCTC CTCCTTGGAC ATTCTTTTGT CTTCATTTGG 2442 

TGCTTTTGGT TTTGCACCTC TCCGTGATTG TAGCCCTACC AGCATGTTAT AGGGCAAGAC 2502 

CTTTGTGCTT TTGATCATTC TGGCCCATGA AAGCAACTTT GGTCTCCTTT CCCCTCCTGT 2562 

CTTCCCGGTA TCCCTTGGAG TCTCACAAGG TTTACTTTGG TATGGTTCTC AGCACAAACC 2622 

TTTCAAGTAT GTTGTTTCTT TGGAAAATGG ACATACTGTA TTGTGTTCTC CTGCATATAT 2682 

» 

CATTCCTGGA GAGAGAAGGG GAGAAGAATA CTTTTCTTCA ACAAATTTTG GGGGCAGGAG V 2742 

ATCCCTTCAA GAGGCTGCAC CTTAAIITTT CTTGTCIGTG TGCAGGICIT CATATAAACT 2802 

FIG.11E 



TTACCAGGAA GAAGGGTGTG AGTTTGTTGT TTTTCTGTGT ATGGGCCTGG TCAGTGTAAA 2862 

GTTTTATCCT TGATAGTCTA GTTACTATGA CCCTCCCCAC TTTTTTAAAA CCAGAAAAAG 2922 

GTTTGGAATG TTGGAATGAC CAAGAGACAA GTTAACTCGT GCAAGAGCCA GTTACCCACC 2982 

CACAGGTCCC CCTACTTCCT GCCAAGCATT CCATTGACTG CCTGTATGGA ACACATTTGT 3042 

CCCAGATCTG AGCATTCTAG GCCTGTTTCA CTCACTCACC CAGCATATGA AACTAGTCTT 3102 

AACTGTTGAG CCTTTCCTTT CATATCCACA GAAGACACTG TCTCAAATGT TGTACCCTTG 3162 

CCATTTAGGA CTGAACTTTC CTTAGCCCAA GGGACCCAGT GACAGTTGTC TTCCGTTTGT 3222 

CAGATGATCA GTCTCTACTG ATTATCTTGC TGCTTAAAGG CCTGCTCACC AATCTTTCTT 3282 

TCACACCGTG TGGTCCGTGT TACTGGTATA CCCAGTATGT TCTCACTGAA GACATGGACT 3342 

TTATATGTTC AAGTGCAGGA ATTGGAAAGT TGGACTTGTT TTCTATGATC CAAAACAGCC 3402 

CTATAAGAAG GTTGGAAAAG GAGGAACTAT ATAGCAGCCT TTGCTATTTT CTGCTACCAT 3462 

TTCTTTTCCT CTGAAGCGGC CATGACATTC CGTTTGGCAA CTAACGTAGA AACTCAACAG 3522 



FIG. 1 1 F 
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AACATTTTCC TTTCCTAGAG TCACCTTTTA GATGATAATG 6ACAACTATA GACTTGCTCA 3582 
TTGTTCAGAC TGATTGCCCC TCACCTGAAT CCACTCTCTG TATTCATGCT CTTGGCAATT 3642 
TCTTTGACTT TCTTTTAAGG GCAGAAGCAT TTTAGTTAAT TGTAGATAAA GAATAGTTTT 3702 
CTTCCTCTTC TCCTTGGGCC AGTTAATAAT TGGTCCATGG CTACACTGCA ACTTCCGTCC 3762 
AGTGCTGTGA TGCCCATGAC ACCTGCAAAA TAAGTTCTGC CTGGGCATTT TGTAGATATT 3822 

AACAGGTGAA TTCCCGACTC TTTTGGTTTG AATGACAGTT CTCATTCCTT CTATGGCTGC 3882 

AAGTATGCAT CAGTGCTTCC CACTTACCTG ATTTGTCTGT CGGTGGCCCC ATATGGAAAC 3942 

CCTGCGTGTC TGTTGGCATA ATAGTTTACA AATGGTTTTT TCAGTCCTAT CCAAATTTAT 4002 

TGAACCAACA AAAATAATTA CTTCTGCCCT GAGATAAGCA GATTAAGTTT GTTCATTCTC 4062 

TGCTTTATTC TCTCCATGTG GCAACATTCT GTCAGCCTCT TTCATAGTGT GCAAACATTT 4122 

TATCATTCTA AATGGTGACT CTCTGCCCTT GGACCCATTT ATTATTCACA GATGGGGAGA 4182 

ACCTATCTGC ATGGACCCTC ACCATCCTCT GIGCAGCACA CACAGTGCAG GGAGCCAGTG 4242 

GCGATGGCGA TGACTTTCU CCCCTG A or 0 
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Potential signal cleavage site 
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MQSQRSRRRS RAPNTWICFW I NKMHAVASL PASLPLLLLT LAFANLPNIV RGTDTALVa|a 

MLGKATCRCA SGFTGEDCQY STSHPCFVSR PCLNGGTCHM LSRDT-YECT CQVGFTGKEC 
GVADYACSCA LGFSGPLCLT PLDNAC-LTN PCRNGGTCDL LT-LTEYKCR CPPGWSGKSC 
NAIDFICHCP VGFTDKVCLT PVDNAC-VNN PCRNGGTCEL LNSVTEYKCR CPPGWTGDSC 
GRPGISCKCP LGFDESLCEI AVPNAC-DHV TCLNGGTCQL KT-LEEYTCA CANGYTGERC 



NLPGSYQCQC PQGFTGQYCD SLYVPCAPSP CVNGGTCRQT GDFTFECNCL PGFEGSTCER 

NEVGSYRCVC RATHTGPNCE RPYVPCSPSP CQNGGTCRPT GDVTHECACL PGFTGQNCEE 

NEFGSYRCTC QNRFTGRNCD EPYVPCNPSP CLNGGTCRQT DDTSYDCTCL PGFSGQNCEE 

NTHGSYQCMC PTGYTGKDCD TKYNPCSPSP CQNAGICRSN G-LSYECKCP KGFEGKNCEQ 
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EGF-I ike Repeats 



QCRDGYEPCV NEGMCVTYHN GTGYCKCPEG FLGEYCQHRD PCE-KNRCQN GGTC— VAQA 
RCSQPGETCL NGGKCEA-AN GTEACVCGGA FVGPRCQDPN PCL-STPCKN AGTCHWDRR 
RCTQTAEMCL NGGRCEMTPG GTGVCLCGNL YFGERCQFPN PCTIKNQCMN FGTCEPVLQG 
SCTSVG— CQ NGGTCVTQLN GKTYCACDSH YVGDYCEHRN PCN-SMRCQN GGTCQVTFRN 



QWTDACLSHP CANGSTCTTV — ANQFSCKC LTGFTGQKCE TDVNEC-DIP GHCQHGGTCL 
QQADPCASNP CANGGQCLPF — EASYICHC PPSFHGPTCR QDVNECGQKP RLCRHGGTCH 
QQADPCASNP CANGGKCLPF — EIQYICKC PPGFHGATCK QDINEC-S-Q NPCKNGGQCI 
ETKNLCASSP CRNGATCTAL AGSSSFTCSC PPGFTGDTCS YDIEEC-Q-S NPCKYGGICV 

N I DDCPNHRC QNGGVCVDGV NTYNCRCPPQ WTGQFCTEDV DECLLQPNA- CQNGGTCANR 
N I DDCPGNNC KNGGACVDGV NTYNCPCPPE WTGQYCTEDV DECQLMPNA- CQNGGTCHNT 
NIDDCPSNNC RNGGTCVDGV NTYNCQCPPD WTGQYCTEDV DECQLMPNA- CQNGGTCHNT 
NYDDCLGHLC QNGGTCIDGI SDYTCRCPPN FTGRFCQDDV DECAQRDHPV CQNGATCTNT 
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NGGYGCVCVN GWSGDDCSEN I DDCAFASCT PGSTCIDRVA SFSCMCPEGK AGLLCHLDDA 

HGGYNCVCVN GWTGEDCSEN I DDCASAACF HGATCHDRVA SFYCECPHGR TGLLCHLNDA 

YGGYNCVCVN GWTGEDCSEN I DDCANAACH SGATGHDRVA SFYCECPHGR TGLLCHLDNA 

HGSYSCICVN GWAGLDCSNN TDDCKQAACF YGATC I DGVG SFYCQCTKGK TGLLCHLDDA 



AFHCECLKGY AGPRCEMDIN ECHSDPCQND ATCLDKIGGF TCLCMPGFKG VHCELEINEC 
SFECQCLQGY TGPRCEIDVN ECVSNPCQND ATCLDQIGEF QCMCMPGYEG VHCEVNTDEC 
SFQCNCPQGY AGPRCEIDVN ECLSNPCQND STCLDQIGEF QCICMPGYEG LYCETNIDEC 
SYRCNCSQGF TGPRCETNIN ECESHPCQNE GSCLDDPGTF RCVCMPGFTG TQCE I D I DEC 



ATGFTGVLCE ENIDNCDPDP CHHGQCQDG I DSYTCICNPG YMGAICSDQI DECYSSPCLN 

TEGYTGTHCE VDIDECDPDP CHYGSCKDGV ATFTCLCRPG YTGHHCETNI NECSSQPCRL 

TEGFTGRHCE QDINECIPDP CHYGTCKDG I ATFTCLCRPG YTGRLCDND I NECLSKPCLN 

PPGYTGTSCE I N I NDCDSNP CHRGKC I DDV NSFKCLCDPG YTGYICQKQI NECESNPCQF 
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CISNPCHKGA LCDTNPLNGQ YICTCPQGYK GADCTEDVDE CAMANSNPCE HAGKCVNTDG 

CISNPCNEGS NCDTNPVNGK AICTCPSGYT GPACSQDVDE CSLG-ANPCE HAGKCINTLG 

CISNPCNEGS NCDTNPVNGK AICTCPPGYT GPACNNDVDE CSLG-ANPCE HGGRCTNTLG 

CTSNPCHADA ICDTSPINGS YACSCATGYK GVDCSEDIDE CDQG— SPCE HNGICVNTPG 



QSNPCVNNGQ CVDKVNRFQC LCPPGFTGPV CQIDIDDCSS TPCLNGAKCI DHPNGYECQC 

ASSPCLHNGR CLDKINEFQC ECPTGFTGHL CQYDVDECAS TPCKNGAKCL DGPNTYTCVC 

ASNPCLHNGK CIDKINEFRC DCPTGFSGNL CQHDFDECTS TPCKNGAKCL DGPNSYTCQC 

QSNPCLNDGT CHDKINGFKC SCALGFTGAR CQINIDDCQS QPCRNRGICH DSIAGYSCEC 



DGRCIDLVNG YQCNCQPGTS GVNCE I NFDD CASNPCIHG- ICMDGINRYS CVCSPGFTGQ 
RGTCQDPDNA YLCFCLKGTT GPNCE I NLDD CASSPCDSG- TCLDKIDGYE CACEPGYTGS 
GGQCTDRENG YICTCPKGTT GVNCETKIDD CASNLCDNG- KCIDKIDGYE CTCEPGYTGK 
DGHCQDRVGS YYCQCQAGTS GKNCEVNVNE CHSNPCNNGA TCIDGINSYK CQCVPGFTGQ 
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RCNIDIDECA SNPCRKGATC INGVNGFRCI . CPEGPHHPSC YSQVNECLSN PCI-HGNCTG 
MCNSN I DECA GNPCHNGGTC EDG I NGFTCR CPEGYHDPTC LSEVNECNSN PCV-HGACRD 
LCNININECD SNPCRNGGTC KDQINGFTCV CPDGYHDHMC LSEVNECNSN PCI-HGACHD 
HCEKNVDEC I SSPCANNGVC IDQVNGYKCE CPRGFYDAHC LSDVDECASN PCVNEGRCED 



DECASNPCLN QGTCFDDISG YTCHCVLPYT GKNCQTVLAP CSPNPCENAA VCKESPNFES 

NECASNPCLN KGTCIDDVAG YKCNCLLPYT GATCEWLAP CAPSPCRNGG ECRQSEDYES 

NECSSNPCLN HGTCIDDVAG YKCNCMLPYT GAICEAVLAP CAGSPCKNGG RCKESEDFET 

DDCVTNPCGN GGTCIDKVNG YKCVCKVPFT GRDCESKMDP CASNRCKNEA KCTPSSNFLD 



CLANPCQNGG SCMDGVNTFS CLCLPGFTGD KCQTDNMECL SEPCKNGGTC SDYVNSYTCK 
CRPNPCHNGG SCTDGINTAF CDCLPGFRGT FCEEDINECA SDPCRNGANC TDCVDSYTCT 
CQPNPCHNGG SCSDG I NMFF CNCPAGFRGP KCEEDINECA SNPCKNGANC TDCVNSYTCT 
CASFPCQNGG TCLDGIGDYS CLCVDGFDGK HCETDINECL SQPCQNGATC SQYVNSYTCT 
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GLSGYKCLCD AGWVGINCEV DKNECLSNPC QNGGTCDNLV NGYRCTCKKG FKGYNCQVNI 
SLNGYKCDCD PGWSGTNCDI NNNECESNPC VNGGTCKDMT SGIVCTCREG FSGPNCQTNI 
GVNGYKCDCE AGWSGSNCDI NNNECESNPC MNGGTCKDMT GAYICTCKAG FSGPNCQTNI 
GINEFICHCP PGYTGKRCEL DIDECSSNPC QHGGTCYDKL NAFSCQCMPG YTGQKCETNI 



YTCLCA-PGW QGQRCTIDID EC- I SKPCMN HGLCHNTQGS YMCECPPGFS GMDCEEDIDD 
FSCVCPTAGA KGQTCEVDIN EC-VLSPCRH GASCQNTHGG YRCHCQAGYS GRNCETDIDD 
FSCECP-^PGW QGQTCE I DMN EC-VNRPCRN GATCQNTNGS YKCNCKPGYT GRNCEMD I DD 
FSCTCK-LGY TGRYCDEDID ECSLSSPCRN GASCLNVPGS YRCLCTKGYE GRDCAINTDD 



CQAGFDGVHC ENNINECTES SCFNGGTCVD GINSFSCLCP VGFTGSFCLH EINECSSHPC 
CPAGFSGIHC ENNTPDCTES SCFNGGTCVD GINSFTCLCP PGFTGSYCQH WNECDSRPC 
CQPGFSGIHC ESNTPDCTES SCFNGGTCID GINTFTCQCP PGFTGSYCQH DINECDSKPC 
CPLGFSGINC QTNDEDCTES SCLNGGSCID G I NGYNCSCL AGYSGANCQY KLNKCDSNPC 
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LNEGTCVDGL GTYRCSCPLG YTGKNCQTLV NLCSRSPCKN KGTCVQKKAE SQCLCPSGWA 
LLGGTCQDGR GLHRCTCPQG YTGPNCQNLV HWCDSSPGKN ;GGKCWQTHTQ YRCECPSGWT 
LNGGTCQDSY GTYKCTCPQG YTGLNCQNLV RWCDSSPCKN GGKCWQTNNF YRCECKSGWT 
LNGATCHEQN NEYTCHCPSG FTGKQCSEYV DWCGQSPCEN GATCSQMKHQ FSCKCSAGWT 



SNPCQHGATC SDFIGGYRCE CVPGYQGVNC EYEVDECQNQ PCQNGGTCID LVNHFKCSCP 

PSPCQNGATC TDYLGGYSCK CVAGYHGVNC SEEIDECLSH PCQNGGTCLD LPNTYKCSCP 

PNPCQNGATC TDYLGGYSCE CVAGYHGVNC SEEINECLSH PCQNGGTCID LINTYKCSCP 

SQPCQNGGTC RDLIGAYECQ CRQGFQGQNC ELNIDDCAPN PCQNGGTCHD RVMNFSCSCP 



CLSNPCSSEG SLDCIQLTND YLCVCRSAFT GRHCETFVDV CPQMPCLNGG TCAVASNMPD 
CLSNPCDARG TQNCVQRVND FHCECRAGHT GRRCESVING CKGKPCKNGG TCAVASNTAR 
CLSNPCDSRG TQNCIQLVND YRCECRQGFT GRRCESWDG CKGMPCRNGG TCAVASNTER 
CLSNPCSNAG TLDCVQLVNN YHCNCRPGHM GRHCEHKVDF CAQSPCQNGG NCNI — RQS 
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f GAYCDVPNVS CDIAASRRGV LVEHLCQHSG VC I NAGNTHY CQCPLGYTGS YCEEQLDECA 
GLYCDVPSVS CEVAAQRQGV DVARLCQHGG LCVDAGNTHH CRCQAGYTGS YCEDLVDECS 
GVYCDVPSVS CEVAAKQQGV DIVHLCRNSG MCVDTGNTHF CRCQAGYTGS YCEEQVDECS 
GKLCDVQT I S CQDAADRKGL SLRQLC-NNG TCKDYGNSHV CYCSQGYAGS YCQKE I DECQ 
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£PHCLN GGQCMDRIGG YSCRCLPGFA GERCEGDINE 



PGTRGLLCEE NIDDCAR 

RGTQGVHCE I NVDDCNPPVD PVSRSPKCFN NGTCVDQVGG YSCTCPPGFV GERCEGDVNE 
RGTQGVHCE I NVDDCTPFYD SFTLEPKCFN NGKCIDRVGG YNCICPPGFV GERCEGDVNE 
PGTMGIICEI NKDDCKP GACHN NGSCIDRVGG FECVCQPGFV GARCEGDINE 



GFICRCPPGF SGARCQS — SCGQVKCRKG EQCVHTAS— GPRCFCPSP RDCES — 

GFICKCPAGF EGATCENDAR TCGSLRCLNG GTCISGPR- SPTCLCLGPF TGPECQFPAS 
GFICKCPPGF DGATCEYDSR TCSNLRCQNG GTCISVLT— SSKCVCSEGY TGATCQYPVI 
v GHHCICNNGF YGKNCELSGQ DCDSNPCRVG -NCWADEGF GYRCECPRGT LGEHCEIDTL 
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-GC-ASSPCQ HGGSCHPQRQ PPYYSCQCAP PFSGSRCELh .-YTAPP S TPP 

SPCLGGNPCY NQGTCEPTSE SPRYRCLCPA KFNGLLCHI 
SPC-ASHPCY NGGTCQFFAE EPFFQCFCPK NFNGLFCHI 

DEC-SPNPCA QGAACEDLLG D— YECLCPS KWKGKRCDIIY DANYPGWNGG SGSGNDRYAA 



- -YTAPP- 
L DYSFGG- 
L DYEFPG- 



-GAGRDIPPP 
-GLGKNITPP 



NN-QCDELCN TVECLFDNFE CQGNSKTCK- -YDKYCADHF KDNHCNQGCN SEECGWDGLD 
SDGHCDSQCN SAGCLFDGFD CQRAEGQCNP LYDQYCKDHF SDGHCDQGCN SAECEWDGLD 
NDGKCDSQCN NTGCLYDGFD CQKVEVQCNP LYDQYCKDHF QDGHCDQGCN NAECEWDGLD 
KNGKCNEECN NAACHYDGHD CERKLKSCDS LFDAYCQKHY GDGFCDYGCN NAECSWDGLD 



YYGEKSAAMK KQ— R- 



IrtTRRSL PGEQ E QEVAGSKVFL 



YYGREEELRK HP I KRAAEGW AAPDALLGQV KASLLPGGSE GGRRRRELOP MDVRGSIVYL 

YYGNEEELKK HHIKRSTDYW SDAPSAI FSTMKES I L LGRHRRELDE MEVRGSIVYL 

WKDNVRVPEI EDTDFARKNK I LYTQQVHQ TGIQIYL 
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LNR (Notch/Lin-12 Repeots) 
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-A — TCL SQYCADKARD GVCDEACNSH ACQWDGGDCS LTMENPWANC SSPLPCWDYI 
LI EE — ACE LPECQEDAGN KVCSLQCNNH ACGWDGGDCS LNFNDPWKNC TQSLQCWKYF 
DNDD — ICE NEQCSELADN KVCNANCNNH ACGWDGGDCS LNFNDPWKNC TQSLQCWKYF 
DLEQQRAV|CD KRGCTEKQGN GICDSDCNTY ACNFDGNDCS LGI-NPWANC TAN-EXWNKF 
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CAADQPEN-L AEGTLVIWL MPPEQLLQDA R-SFLRALGT LLHTNLRIKR DSQGELMVYP 
CAEHVPER-L AAGTL-WW LMPPEQLRNS SFHFLRELSR VLHTNWFKR DAHGQQMIFP 
C-ANMPEN-L AEGTLVLWL MPPERLKNNS V-NFLRELSR VLHTNWFKK DSKGEYKIYP 
CEjNKTQSPVL AEGAMSWML MNVEAFREIQ A-QFLRNMSH MLRTTVRLKK DALGHDI I IN 

TM 



E I DNRQCVQD SDHCFKNTDA AAALLASHAI QG — TLSYP LVSWSESLT PERT-Q-.LY 
EIDNRQCVQA SSQCFQSATD VAAFLGALAS LGSL-N I PYK IEAVQSETVE PPPPAQ-.HF 
E I DNRQCYKS SSQCFNSATD VAAFLGALAS LGSLDTLSYK I EAVKSENME TPKPST-.YP 
^EIDNRKCTEC FTHAVEAAEF LAATAAKHQL RNDFQ-IHSV RG I KNPGDED NGEPPAN^KY 
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LLAVAWIIL FIILLGVIWX KRKRK — HGS LWLPEGFTLR RDASNHKRRE PVGQDAVGLK 



MYVAAAAFVL LFFVGCGVL 
MLSMLVIPLL I IFVFWMVI 
VITGIILVII ALAFFGMVL 



|_ SRKRRRQHGQ LWFPEGFKV- SEASKKKRRE ELGEDSVGLK 
NKKRRREHDS FGSPTALFQK" NPA-KRNGET PW-EDSVGLK 
STQRKRAHGV "TWFPEGFRAP AAVMSRRRRD PHGQEMRNLN 



CDC-10/Ankyrin Repeats 

PIDRRPWTQQ HLEAAD1RRT PSLALTPPQA EQEVDVLDVN VRGPDGCTPL MLASLRGGSS 

QTDHRQWTQQ HLDAADL-RM SAMAPTPPQG EVDADCMDVN VRGPDGFTPL MIASCSGGGL > 

KTDPRQWTRQ HLDAADL-R 1 SSMAPTPPQG EIEADCMDVN VRGPDGFTPL MIASCSGGGL 

EADQRVWSQA HLDWDV-R- AIM— TPP-A HQDGGKHD V.D ARGPCGLTPL MIAAVRGGGL 

ANAQDNMGRC PLHAAVAADA QGVFQ I L I RN RVTDLDARMN DGTTPLILAA RLAVEGMVAE 
ANIQDNMGRT PLHAAVSADA QGVFQ I L I RN RATDLDARMH DGTTPLILAA RLAVEGMLED 
ANVQDNMGRT PLHAAVAADA QGVFQIL IRN RATDLDARMF DGTTPLILAA RLAVEGMVEE 
ANCQDNTGRT PLHAAVAADA MGVFQILLRN RATNLNARMH DGTTPLILAA RLA1EGMVED y 



r NLSVQVSEAN L IGTGTSEHW VDDE- 
PLK-NASDGA LMDDNQNE-W GDED- 
P I K-NMTDGS FMDDNQNE-W GDEET- 
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<5 PQPKKVKAED EALLSE-EDD 

- LETKKFRFEE PWLPD-LDD 

- LENKRFRFEE QVILPELVDD 



KQVAMQSQGV GQPGAH — VI SDDESDMPLP KRQRSDPVSG VGLGNNGGYA SDHTMVSEYE 



DLSDEDEDAE DSSAN 1 1 TOL VYQGASLQAQ TDRTGEMALH LAARYSRADA AKRLLDAGAD 
ETGNSEEE-E DAPA-VISDF IYCCASLHNQ TDRTGETALH LAARYSRSDA AKRLLEASAD 
ETGNSEEE-E DASANMISDF IGQGAQLHNQ TDRTGETALH LAARYARADA AKRLLESSAD 
DTGEDIENNE DSTAQVISDL LAQGAELNAT MDKTGETSLH LAARFARADA AKRLLDAGAD 



L I NCQADVNA VDDHGKSALH WAAAVNNVEA TLLLLKNGAN RDMQDNKEET PLFLAAREGS 
L I NSHADVNA VDDLGKSALH WAAAVNNVDA AWLLKNGAN KDMQNNREET PLFLAAREGS 
L I NAHADVNA VDEFGKSALH WAAAVNNVDA AAVLLKNSAN KDMQNNKEET SLFLAAREGS 
LITADADINA ADNSGKTALH WAAAVNNTEA VNILLMHHAN RDAQDDKDET PLFLAAREGS 
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YEAAKILLDH FANRD I TDHM DRLPRDVARD RMHHDIVRLL DEYNVTPSPP — GTVL— TS 

YETAKVLLDH FANRD I TDHM DRLPRDIAQE RMHHDIVRLL DEYNLVRSPQ LHGAPLGGTP 

YETAKVLLDH YANRD I TDHM DRLPRDIAQE RMHHDIVHLL DEYNLVKSPT LHNGPLGAT- 

YEACKALLDN FANRE I TDHM DRLPRDVASE RLHHDIVRLL DE-HVPRSPQ MLSMTPQAMI 



NLS CK II cdc2 

GSlRRKRSLSE KVQLSE— SS VTLMsLE 
A-RRKKSQDG KGCLLD— SS GMLSPVDSLE 
A-IRRKnSQDG KTTLLDSGSS GVLSPVDSLE 



cdc2 



sphtyvsdtt 
sphgylsdva 
sjhgJylsdvs 




GS-PDNGLDA TGSURRKIASS KKTSAASKKA ANLNGLNPGQ LTGGVSGVPG VPPTNSAAQA 
BNTS 



YEyC I KNAQS MQSLQGNGLD MIKLDNYAYS MGgPF 
CK II cdc2 



MTSPF 



ITSPGILQAS PNPML— ATA APPAPVHAQH 
LPSPFHOClS PSVPLNHLPG MPDTHLGIGH 

PSMPLNHLTS MPESQLGMNH 
LLNGQGLGMN GNGQRNGVGP 
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GSPPPGQQQP QLITQPTVIS AGNGGNNGNG NASGKQSNQT AKQKAA- 
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GSAGSLSRLH PVPVPADW— MNRMEVNETQ YNEMFGMVLA . PAEG-THPG I APQSRPPEGK 
GQCEWLSRLQ SGMVPNQYNP LRGSVAPGPL STQAPSLQHG -MVGPLHSSL AASALSQMMS 
SQCDWLARLQ NGMVQNQYDP- IRNGIQQGN- AQQAQALQHG LMTS-LHNGL PATTLSQMMT 
PSLPTSPTHI QAMRHATQQK QFGGSNLNSL LGGANGGGW GGGGGGGGGV GQGPQNSPVS 

APQPQSTCPP AVAGPLPTMY QIP -EM ARL-PS VAF P TAMMPQQDGQ VAQT I LPAYH 

PPQPHLGVSS AASGHLGRSF LSGEPSQADV QPLGPSSLAV HTILPQ-ESP ALPTSLPSSL 
MQQQHHN-SS TTSTHINSPF CSSDISQTDL QQM— SSNN I HSVMPQ-OTQ I FAASLPSNL 
QQQLGGLEFG SAGLDLNG-F CGSPDSFHSG QMNPPS — I QSSMSG-SSP STNMLSPSSQ 



SDWSDVTTSP TPGGAGGGQR GPGTHMSEPPHNN MQVYA 

SDWSEGVSSP PT SMQ SQIARIPEAFK 

SDWSEGISSP PT SMQ PQRTHIPEAFK 



SDWSEGVQSP AA| NNLYiSGG HQANKGSEA I Y I 
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— HITTPRE PLPP-IV-TF QLIPKGSIAQ PAG 

-YQGLPSTRL ATQPHLVQTQ QVQPQNLQMQ QQNLQPANIQ QQQSLQPPPP 
-YQAMPNTRL ANQPHLMQAQ QMQQQQN LQLHQS 



LGIISPTGSD MGIMLAPPQS SKNSAIMQTI SPQQQQQQQQ QQQQQHQQQQ Q 
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TPPSQHSY-S S-PVENTPSH QLQVP-EGPF LTPSPESPDQ WSSSSPHSNV 
TPPSQHSY-S S-PMDNTPSH QLQVP-DHPF LTPSPESPDQ WSSSSPHSNM 
TPSSQHS GGHTPQH LVQTL-D-SY PTPSPESPGH WSSSSPRSN- 



2433 
2530 
2497 
2671 

2471 
2556 
2523 
2703 



FIG.13H 



5/ 




FIG. 14 




5^ 



4 





• • • ♦ ♦ • * ♦ , 

CGAGATGGCT ATGAACCCTG TGTAAATCAA GGAATGTGTG TTACCTACCA CAATGGCACA GGATACTGCA AATGTCCAGA AGGCTTCTTG 
R 0 G Y E P C V N E CMC V T Y H N G T G Y C K C P E G F L> 



NO 200 210 220 230 240 250 260 270 
♦♦♦»♦»»♦, 
GGGGAATATT GTCAACATCG AGACCCCTGT GAGAAGAACC GCTGCCAGAA TGGTGGGACT TGTGTGGCCC AGGCCATGCT GGGGAAAGCC 
GEY CQHR DPC EKN RCON GGT CVA QAM I GKA> 
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♦ . • • » * « » « t 
ACGTGCCGAT GTGCCTCAGG GTTTACAGGA GAGGACTGCC AGTACTCAAC ATCTCATCCA TGCTTTGTGT CTCGACCCTG CCTGAATGGC 
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TGTGAGACTG ATGTCAATGA GTGTGACATT CCAGGACACT GCCAGCATGG TGGCACCTGC CTCAACCTGC CTGGTTCCTA CCAGTGCCAG 
CET OVNE CDI PGH CQHG GTC LNL PCS Y Q C Q> 
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TGCCCTCAGG GCTTCACAGG CCAGTACTGT GACAGCCTGT ATGTGCCCTG TGCACCCTCA CCTTGTGTCA ATGGAGGCAC CTGTGGGCAG 
cp 0 GFTG QYC DSL YVPC APS PCV NGGT CRQ> 
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CQN GGVC VDG VNT YNCR CPP Q W I GQFC TED> 
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V D E CLLQ PNA CON GGTC ANR NGG YGCV CVN> 
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* * * ♦ ♦ » ♦ ♦ . 
GAGAACATTG ACAACTGTGA CCCCGATCCT TGCCACCATG GTCAGTGTCA GGATGGTATT GATTCCTACA CCTGCATCTG CAATCCCGGG 

E N I D N C D P D P C H H G Q C Q D G I D S Y T C I C N P G> 

1810 1820 1830 1840 1850 1860 1870 1880 1890 
********* 
TACATGGGCG CCATCTGCAC TCACCAGATT GATGAATGTT ACAGCAGCCC TTGCCTGAAC GATGGTCGCT GCATTGACCT GGTCAATGCC 
Y M G A I C S 0 Q I 0 E C Y S S P C L N 0 G R C I D L V N G> 

1900 1910 1920 1930 1940 1950 1960 1970 1980 

* * * ♦ » * ♦ * * 

TACCAGTGCA ACTGCCAGCC AGGCACGTCA GGGGTTAATT GTGAAATTAA TTTTGATCAC TGTGCAAGTA ACCCTTGTAT CCATGGAATC 
YQC NCOP GTS GVN CEIN FDD CAS NPCI HGI> 

1990 2000 2010 2020 2030 2040 2050 2060 2070 

* * » ♦ * ♦ ♦ , , 
TGTATGGATG GCATTAATCG CTACAGnGT GTCTGCTCAC CAGGATTCAC AGGGCAGAGA TGTAACATTG ACATTGATGA GTGTGCCTCC 
C M 0 G I N R Y S C V C S P G F T G Q R C N I D I D E C A S> 

2080 2090 2100 2110 2120 2130 2140 2150 2160 

* * ♦ ♦ ♦ * * « , 
AATCCCTGTC GCAAGGGTGC AACATGTATC AACGGTGTGA ATGGTTTCCG CTGTATATGC CCCGAGGGAC CCCATCACCC CAGCTGCTAC 

NPC RKCA TCI NGV NGFR CIC PEG PHHP SCY> 

2170 2180 2190 2200 2210 2220 2230 2240 2250 

* •**««♦♦ » 
TCACAGGTGA ACGAATGCCT GAGCAATCCC TGCATCCATG GAAACTGTAC TGGAGGTCTC AGTGGATATA AGTGTCTCTG TGATGCAGGC 
S Q V N E C L SUP C I H G N C T G G L S G Y K C L C 0 A G> 

2260 2270 2280 2290 2300 2310 2320 2330 2340 
♦♦♦*•»»,» 
TGGGTTGGCA TCAACTGTGA AGTGGACAAA AATGAATGCC TTTCGAATCC ATGCCAGAAT GGAGGAACTT GTGACAATCT GGTGAATGGA 
W V G I N C E V 0 K NEC L S N P C Q N G G T C D N L V N G> 

2350 2360 2370 2380 2390 2400 2410 2420 2430 

* ♦ ♦ * * » » » , 
TACAGGTGTA CTTGCAAGAA GGGCTTTAAA GGCTATAACT GCCAGGTGAA TATTGATGAA TGTGCCTCAA ATCCATGCCT GAACCAAGGA 
YRC TCKF GFK GYN CQVN IDE CAS NPCL NOG) 
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2440 2450 2460 2470 2480 2490 2500 2510 2520 

♦ ♦ » * ♦ » » » ♦ 

ACCTGCTTTG ATGACATAAG TGGCTACACT TGCCACTGTG TGCTGCCATA CACAGGCAAG AATTGTCAGA CAGTATTGGC TCCCTGTTCC 
T C F DDIS G Y T C H C VLPY.TGKNCQ T V I A P C S> 

2530 2540 2550 2560 2570 2580 2590 2600 2610 
*♦»*•»♦♦♦ 
CCAAACCCTT GTGAGAATGC TGCTGTTTGC AAAGAGTCAC CAAATTTTGA GAGTTATACT TGCTTGTGTG CTCCTGGCTG GCAAGCTCAG 
P N P C E N A A V C K E S P N F E S Y T C L C A P G tf Q G Q> 

2620 2630 2640 2650 2660 2670 2680 2690 2700 

♦ * • * ♦ » ♦ » » 
CGGTGTACCA TTGACATTGA CGAGTGTATC TCCAAGCCCT GCATGAACCA TGGTCTCTGC CATAACACCC AGGGCAGCTA CATGTGTGAA 

R C T I D I 0 E C I S K P C M N H G L C H N T Q G S Y M C E> 

2710 2720 2730 2740 2750 2760 2770 2780 2790 
********* 
TGTCCACCAG GCTTCAGTGG TATGGACTGT GAGGAGGACA TTGATGACTG CCTTGCCAAT CCTTGCCAGA ATGGAGGTTC CTGTATGGAT 
C P P G F S G M 0 C E E D I 0 0 C LAN P C 0 N G G S C M D> 

2800 2810 2820 2830 2840 2850 2860 2870 2880 

♦ ♦ ♦ ♦ * ♦ ♦ » t 
GGAGTGAATA CTTTCTCCTG CCTCTGCCTT CCGGGTTTCA CTGGGGATM GTGCCAGACA GACATGAATG AGTGTCTGAG TGAACCCTGT 

G V N T F S C L C L P G F T G 0 K C Q T DUN E C L S E P C> 

2890 2900 2910 2920 2930 2940 2950 2960 2970 
♦♦**♦♦»»» 
AAGAATGGAG GGACCTGCTC TGACTACGTC AACAGTTACA CTTGCAAGTG CCAGGCAGGA TTTGATGGAG TCCATTGTGA GAACAACATC 
KNG GTCS OYV NSY TCKC QAG FOG VHCE NNI> 

2980 2990 3000 3010 3020 3030 3040 3050 3060 

♦ ♦ ♦ * ♦ * * * « 
AATGAGTGCA CTGAGAGCTC GTGTTTCAAT GGTGGCACAT GTGTTGATGG GATTAACTCC TTCTCTTGCT TGTGCCCTGT GGGTTTCACT 

NEC TESS CFN GGT CVOG INS FSC LCPV GFT> 

3070 3080 3090 3100 3110 3120 3130 3140 3150 
**♦•♦♦»«♦ 
GGATCCTTCT GCCTCCATGA GATCAATGAA TGCAGCTCTC ATCCATGCCT GAATGAGGGA ACGTGTGTTG ATGGCCTGGG TACCTACCGC 
CSF CLHE INE CSS HPCL NEC TCV DGLG TYR> 

3160 3170 3180 3190 3200 3210 3220 3230 3240 
******** t 
TGCAGCTGCC CCCTGGGCTA CACTGGGAAA AACTGTCAGA CCCTGGTGAA TCTCTGCAGT CGGTCTCCAT GTAAAAACAA AGGTACTTGT 
CSC PLGY TGK NCQ TLVN ICS RSP CKNK CIO 
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3250 3260 3270 ,3280 3290 3300.: 3310 3320 3330 

♦ » ♦ ' * , , , 

GITCAGAAAA AAGCAGAGTC CCAGTGCCTA TGTCCATCTG GATGGGCTGG TGCCTATTGT GACGTGCCCA ATGTCTCTTG TGACATAGCA 
V 0 K K A E S Q C L C P S G W A C A Y C 0 V P N V S C 0 I A> 

3340 3350 3360 3370 3380 3390 3400 3410 3420 

♦ ♦ ♦ ♦ » » » » , 

GCCTCCACGA GAGGTGTGCT TGTTGAACAC TTGTGCCAGC ACTCAGGTGT CTGCATCAAT CCTGGCAACA CGCATTACTG TCAGTGCCCC 
A S R R C V L V E H ICQ H S G V C I N A G N T H Y C Q C P> 

3430 3440 3450 3460 3470 3480 3490 3500 3510 
*♦♦♦♦♦*♦♦ 
CTGGGCTATA CTGGGAGCTA CTGTGAGGAG CAACTCGATG AGTGTGCGTC CAACCCCTGC CAGCACGGGG CAACATGCAG TGACTTCATT 
L C Y T G S Y C E E OLD E C A S N P C 0 H G A T C S 0 F |> 

3520 3530 3540 3550 3560 3570 3580 3590 3600 
•*♦«♦♦♦»♦ 
GGTGGATACA GATGCGAGTG TGTCCCAGGC TATCAGGGTG TCAACTGTGA GTATGAAGTG GATGAGTGCC AGMTCAGCC CTGCCAGAAT 
G G Y R C E C V P G Y Q G V N C E Y E V DEC 0 N Q P CON) 

3610 3620 3630 3640 3650 3660 3670 3680 3690 

* ♦ ♦ ♦ ♦ « ♦ . ♦ 
GGAGGCACCT GTATTGACCT TGTGAACCAT TTCAAGTGCT CTTGCCCACC AGGCACTCGG GGCCTACTCT GTGAAGAGAA CATTGATGAC 

G c T C I D L V N H F K C S C P P G T R G L L C E E N I 0 D> 

3700 3710 3720 3730 3740 3750 3760 3770 3780 

♦ ♦ ♦ ♦ « . , , , 
TGTGCCCGGG GTCCCCATTG CCTTAATGGT GGTCAGTGCA TGGATAGGAT TGGAGGCTAC AGTTGTCGCT GCTTGCCTGG CTTTGCTGGG 
CAR CPHC LNG GQC MORI CGY SCR CLPG FAG> 

3790 3800 3810 3820 3830 3840 3850 3860 3870 
*♦♦»♦♦♦♦» 
GAGCGTTGTG AGGGAGACAT CAACGAGTGC CTCTCCAACC CCTGCAGCTC TGAGGGCAGC CTGGACTGTA TACAGCTCAC CAATGACTAC 
E R C E G D I NEC L S N P C S S E G S L D C I Q L T N D Y> 



3880 3890 3900 3910 3920 3930 3940 3950 



3960 



CTGTGTGTTT GCCGTAGTGC CTTTACTGGC CGGCACTGTG AAACCTTCGT CGATGTGTGT CCCCAGATGC CCTGCCTGAA TGGAGGGACT 
L C V C R S A FTC R H C E T F V D V C P Q M P C L N G G T> 

3970 3980 3990 4000 4010 4020 4030 4040 4050 
* * * * » ♦ ♦ ♦ ♦ 

TGTGCTGTGG CCAGTAACAT GCCTGATGGT TTCATTTGCC GTTGTCCCCC GGGATTTTCC GGGGCAAGGT GCCAGAGCAG CTGTGGACAA 
C A V AS N M P D G F I C R C P P G F S GAR C Q S S C G Q> 

FIG.17E 




4060 4070 4080 4090 4100 4110 4120 4130 



4140 



GTGAAATGTA GGAAGGGGGA GCAGTGTGTG CACACCGCCT CTGGACCCCG CTGCTTCTGC CCCAGTCCCC GGGACTGCGA GTCAGGCTGT 
V K C R K G E Q C V H T A S G P R C F C P S P R 0 C E SCO 

4150 4160 4170 4180 4190 4200 4210 4220 4230 

* * » » * ♦ ♦ . , 

GCCAGTAGCC CCTGCCAGCA CGGGGGCAGC TGCCACCCTC AGCGCCAGCC TCCTTATTAC TCCTGCCACT GTGCCCCACC ATTCTCGGGT 
ASS P C Q H G G S C H P Q R Q P P Y Y SCO. C A P P F S G> 

4240 4250 4260 4270 4280 4290 4300 4310 4320 

♦ ♦ • » » « » » , 
AGCCGCTGTG AACTCTACAC GGCACCCCCC AGCACCCCTC CTGCCACCTG TCTCACCCAG TATTGTGCCG ACAAAGCTCG GGATGGCGTC 

SRC E L Y T A P P S T P P A T C L S 0 Y C A 0 K A R 0 G V> 



4330 4340 4350 4360 4370 4380 4390 4400 



4410 



TGTGATGAGG CCTGCAACAG CCATGCCTGC CAGTGGGATG GGGGTGACTG TTCTCTCACC ATGGAGAACC CCTGGGCCAA CTGCTCCTCC 
C D E A C N S H A C 0 W 0 G G D C S L T MEN P W A N C S S> 



4420 4430 4440 4450 4460 4470 4480 4490 



4500 



CCACTTCCCT GCTGGGATTA TATCAACAAC CAGTGTGATG AGCTGTGCAA CACGGTCGAG TGCCTGTTTG ACAACTTTGA ATGCCAGGGG 
P L P C W D Y INN Q C D E L C N T V E C IF 0 N F E COO 



4510 4520 4530 4540 4550 4560 4570 4580 



4590 



AACAGCAAGA CATGCAAGTA TGACAAATAC TGTGCAGACC ACTTCAAAGA CAACCACTGT AACCAGGGGT GCAACAGTGA GGAGTGTGGT 
N S K T C K Y D K Y CAD H F K 0 N H C NOG C N S E ECO 



4600 4610 4620 4630 4640 4650 4660 4670 



4680 



TGGGATGGGC TGGACTGTGC TGCTGACCAA CCTGAGAACC TGGCAGAAGG TACCCTGGTT ATTGTGGTAT TGATGCCACC TGAACAACTG 
W0G L0CA ADQ PEN LAEG TLV IVV LMPP EQL> 

4690 4700 4710 4720 4730 4740 4750 4760 4770 

»♦»♦.. 
CTCCAGGATG CTCGCAGCTT CTTGCGGGCA CTGGGTACCC TGCTCCACAC CAACCTGCGC ATTAAGCGGG ACTCCCAGGG GGAACTCATG 
LQO ARSFLRA LGT LLHT NLR IKR OSQG ELM> 

4780 4790 4800 4810 4820 4830 4840 4850 4860 

* * * ♦ » * * ♦ 

GTGTACCCCT ATTATGGTGA GAAGTCAGCT GCTATGAAGA AACAGAGGAT GACACCCAGA TCCCTTCCTC GTGAACAAGA ACAGGAGGTG 
VYP YYGE KSA A II K KQRM TRR SLP GEQE QEV> 
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4870 4880 4890 4900 4910 4920 4930 4940 



4950 



* 



GCTGGCTCTA AAGTCTTTCT GGAAATTGAC AACCGCCAGT GTGTTCAAGA CTCAGACCAC TGCTTCAAGA ACACGGATGC AGCAGCAGCT 
AGS K V F L E I D N R Q C V Q 0 S 0 H C F K N T 0 A A A A> 

4960 4970 4980 4990 5000 5010 5020 5030 5040 
**♦*•♦♦*♦ 

CTCCTGGCCT CTCACGCCAT ACAGGGGACC CTGTCATACC CTCTTGTGTC TGTCGTCAGT GAATCCCTGA CTCCAGAACG CACTCAGCTC 
LLA SHAI QGT LSY PLVS VVS ESL T P E R T Q L> 

5050 5060 5070 5080 5090 5100 5110 5120 5130 
* • * « ♦ « ♦ ♦ , 

CTCTATCTCC TTGCTGTTGC TCTTGTCATC ATTCTGTTTA TTATTCTGCT GGGGGTAATC ATGGCAAAAC GAAAGCGTAA GCATGGCTCT 
L Y 1 LAVA V V I I L F I I L L G V I M A K R K R X H G S> 



5140 5150 5160 5170 5180 5190 5200 5210 



5220 



CTCTGGCIGC CTGAAGGTTT CACTCTTCGC CGAGATGCAA GCAATCACAA GCGTCGTGAG CCAGTGGGAC AGGATGCTGT GGGGCTGAAA 
L w L P E G F T L R R D A S N H K R R E P V G 0 0 A V G L K> 

5230 5240 5250 5260 5270 5280 5290 5300 5310 
1 • * « » * » » , 

AATCTCTCAG TGCAAGTCTC AGAAGCTAAC CTAATTGGTA CTGGAACAAG TGAACACTGG GTCGATGATG AAGGGCCCCA GCCAAAGAAA 
N L S V Q V S E A N L I G T G T S E H W V D D E G P Q P K K> 



5320 5330 5340 5350 5360 5370 5380 5390 



5400 



GTAAAGGCTG AAGATGAGGC CTTACTCTCA GAAGAAGATG ACCCCATTGA TCGACGGCCA TGGACACAGC AGCACCTTGA AGCTGCAGAC 
V K A E D E A L L S E E 0 0 P I 0 R R P W T Q Q H I E A A 0> 



5410 5420 5430 5440 5450 5460 5470 5480 



5490 



ATCCGTAGGA CACCATCGCT GGCTCTCACC CCTCCTCAGG CAGAGCAGGA GGTGGATGTG TTAGATGTGA ATGTCCGTGG CCCAGATGGC 
IRR TPSL ALT PPQ AEQE VOV LDV NVRG PDG> 



5500 5510 5520 5530 5540 5550 5560 5570 



5580 



TGCACCCCAT TGATGTTGGC TTCTCTCCGA GGAGGCAGCT CAGATTTGAG TGATGAAGAT GAAGATGCAG AGGACTCTTC TGCTAACATC 
C T P L M L A SLR G G S S D L S D E D EDA E D S S A N |> 



5590 5600 5610 5620 5630 5640 5650 5660 



5670 



ATCACAGACT TGGTCTACCA GGGTGCCAGC CTCCAGGCCC AGACAGACCG GACTGGTGAG ATGGCCCTGC ACCTTGCAGC CCGCTACTCA 
I T 0 L V Y Q GAS L Q A Q T 0 R T G E M A L H L A A R Y SX 
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5680 5690 5700 5710 5720 5730 5740 5750 5760 

CGCGCTGATG CTGCCAAGCG TCTCCTGGAT GCAGGTGCAG ATGCCAATGC CCAGGACAAC ATGGGCCGCT GTCCACTCCA TGCTGCAGTG 
RAO A A K R L L D A C A 0 A N A ... jQ D N , M G R C P L H A A V> 



5770 5780 5790 5800 5810 5820 5830 5840 5850 
* * ♦ » ♦ ♦ ♦ » ♦ 

GCAGCTGATG CCCAAGGTGT CTTCCAGATT CTGATTCGCA ACCGAGTAAC TGATCTAGAT GCCAGGATGA ATGATGGTAC TACACCCGTG 
A A 0 A Q G V F Q I L I R N R V T OLD ARM N 0 G T T P L> 

5860 5870 5880 5890 5900 5910 5920 5930 5940 
**♦*♦«».♦ 
ATCCTGGCTG CCOGCCTGGC TGTGGAGGGA ATGGTGGCAG AACTGATCAA CTGCCAAGCG GATGTGAATC CAGTCGATGA CCATGGAAAA 
I L A A R L A V E G M V A [ L I N C 0 A 0 V N A V 0 0 H G K> 



5950 5960 5970 5980 5990 6000 6010 6020 



6030 



TCIGCTCTTC ACTGGGCAGC TGCTGTCAAT AATGTGGAGG CAACTCTTTT GTTGTTGAAA AATGGGGCCA ACCGAGACAT CCAGGACAAC 
SAL H W A A A V N N V E A T L L L L K N G A N R 0 M Q 0 N> 



6040 6050 6060 6070 6080 6090 6100 6110 



6120 



AAGGAAGAGA CACCTCTGTT TCTTGCTGCC CGGGAGGGGA GCTATGAAGC AGCCAAGATC CTGTTAGACC ATTTTGCCAA TCGAGACATC 
KEE TPLF LAA REG SYEA AKI LLO HFAN RDI> 



6130 6140 6150 6160 6170 6180 6190 6200 



6210 



ACAGACCAIA TGGATCGTCT TCCCCGGGAT GTGGCTCGGG ATCGCATGCA CCATGACATT GTGCGCCTTC TGGATGAATA CAATGTGACC 
TOH MORL PRO V A R DRMH H D I VRL LOEY NVT> 



6220 6230 6240 6250 6260 6270 6280 6290 



6300 



CCAAGCCCTC CAGGCACCGT GTTGACTTCT GCTCTCTCAC CTGTCATCTG TGGGCCCAAC AGATCTTTCC TCAGCCTGAA GCACACCCCA 
P s p P G T V LIS A L S P V I C G P N R S F L S L K H T P> 



6310 6320 6340 6350 6360 6370 6380 6390 



6400 



ATGGGCAAGA AGTCTAGACG CCCCAGTGCC AAGAGTACCA TGCCTACTAG CCTCCCTAAC CTTGCCAAGG AGGCAAAGGA TGCCAAGGGT 
M C K K S R R PSA K S T M P T S L P N L A K E A K D A K G> 

6400 6410 6420 6430 6440 6450 6460 6470 6480 

• • ♦ « ♦ , , , 

AGTAGGAGGA AGAAGTCTCT GAGTGAGAAG GTCCAACTGT CTGAGAGTTC AGTAACTTTA TCCCCTGTTG ATTCCCTAGA ATCTCCTCAC 
S R R K K S L S E K VOL S E S S V T L S P V D S L E S P H> 
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6490 6500 6510 6520 6530 6540 6550 6560 6570 

• * » • .; : ••• • » ••• * ♦ , 

AOGTATGTTT CCGACACCAC ATCCTCTCCA ATGATTACAT CCCCTGGGAT CTTACAGGCC TCACCCAACC CTATGTTGGC CACTGCCGCC 
T Y V S D T T S S P MIT S P G I L Q A S P N P M L A T A A> 



6580 6590 6600 6610 6620 6630 6640 6650 6660 

• ■ • ♦ * ♦ . , ♦ , 

CCTCCTGCCC CAGTCCATGC CCAGCATGCA CTATCTTTTT CTAACCTTCA TGAAATGCAG CCTTTGGCAC ATGGGGCCAG CACTGTGCTT 

P P A P V H A Q H A L S F S N L H E M Q P L A H G A S T V L> 

6670 6680 6690 6700 6710 6720 6730 6740 6750 

* ♦ ♦ » * * * ♦ ♦ 
CCCTCAGTCA GCCAGTTGCT ATCCCACCAC CACATTGTGT CTCCAGGCAG TGGCAGTGCT GGAAGCTTGA GTAGCCTCCA TCCAGTCCCA 

P S V S Q L L S H H HIV S P G S G S A G S L S R L H P V P> 



6760 6770 6780 6790 6800 6810 6820 6830 



6840 



GTCCCAGCAG ATTGGATGAA CCGCATGGAG GTGAATGAGA CCCAGTACAA TGAGATGTTT GGTATGGTCC TGGCTCCAGC TGAGGGCACC 
V P A D W M N R M E V N E T 0 Y N E M F G M V L A P A E G T> 



6850 6860 6870 6880 6890 6900 6910 6920 



6930 



CATCCTGGCA TAGCTCCCCA GAGCAGGCCA CCTGAAGGGA AGCACATAAC CACCCCTCGG GAGCCCTTGC CCCCCATTGT GACTTTCCAG 
HPG IAPQ SRP PEG KHIT TPR EPL P P I V TFQ> 

6940 6950 6960 6970 6980 6990 7000 7010 7020 

CTCATCCCTA AAGGCAGTAT TGCCCAACCA GOGGGGGCTC CCCAGCCTCA GTCCACCTGC CCTCCAGCTG TTGCGGGCCC CCTGCCCACC 
L,p KGSJ AQP AGA PQPQ STC PPA VAGP LPT> 



7030 7040 7050 7060 7070 7080 7090 7100 



7110 



ATGTACCAGA TTCCAGAAAT GGCCCGTTTG CCCAGTGTGG CTTTCCCCAC TGCCATGATG CCCCAGCAGG ACGGGCAGGT AGCTCAGACC 
My 0 IPEM ARL PSV AFPT AMM PQQ OGQV AQT> 

7,20 71 ^0 7140 7150 7160 7170 7180 7190 7200 
* • ♦ » * ♦ ♦ ♦ ♦ 

ATTCTCCCAG CCTATCATCC TTTCCCAGCC TCTGTGGGCA AGTACCCCAC ACCCCCTTCA CAGCACAGTT ATGCTTCCTC AAATGCTGCT 
I L P A Y H P F P A SVG K Y P T P P S Q H S Y A S S N A A> 



7210 7220 7230 7240 7250 7260 7270 7280 



7290 



GAGCGAACAC CCAGTCACAG TGGTCACCTC CAGGGTGAGC ATCCCTACCT GACACCATCC CCAGAGTCTC CTGACCAGTG GTCAAGTTCA 
ERI PSHS GHL QGE HPYL TPS PES POOW SSS> 



FIG. 1 71 



7300 7310 7320 7330 7340 7350 7360 7370 7380 

♦ * ♦ • ♦ v ♦ .* * ♦ * * 

TCACCCCACT CTGCTTCTGA CTGGTCAGAT GTGACCACCA GCCGTACCCC TGGGGGTGCT GGAGGAGGTC AGCGGGGACC TGGGACACAC 
SPH SASO WSO VTT SPT P -G G A GGG QRGP GTH> 

7390 7400 7410 7420 7430 7440 7450 7460 7470 
********* 

ATGTCTGAGC CACCACACAA CAACATGCAG GTTTATGCGT GAGAGAGTCC ACCTCCAGTG TAGAGACATA ACTGACTTTT GTAAATGCTG 
MSE PPHN NMQ VYA> 

7480 7490 7500 7510 7520 7530 7540 7550 7560 
********* 

CIGAGGAACA AATGAAGGTC ATCCGGGAGA GAAATGAAGA AATCTCTGGA GCCAGCTTCT AGAGGTAGGA AAGAGAAGAT GTTCTTATTC 

7570 7580 7590 7600 7610 7620 7630 7640 7650 
********* 
AGATAATGCA AGAGAAGCAA TTOGTCAGTT TCACTGGGTA TCTGCAAGGC TTATTGATTA TTCTAATCTA ATAAGACAAG TTTGTGGAAA 

7660 7670 7680 7690 7700 7710 7720 7730 7740 
» * * * * « * * # 

TGCAAGATGA ATACAAGCCT TGGGTCCATG TTTACTCTCT TCTATTTGGA GAATAAGATG GATGCTTATT GAAGCCCAGA CATTCTTGCA 

7750 7760 7770 7780 7790 7800 7810 7820 7830 

♦ * * * * * * * * 
GCTTGGACTG CATTTTAAGC CCTGCAGGCT TCTGCCATAT CCATGAGAAG ATTCTACACT AGCGTCCTGT TGGGAATTAT GCCCTGGAAT 

7840 7850 7860 7870 7880 7890 7900 7910 7920 

♦ ♦♦»*♦»*♦ 
TCTGCCTGAA TTGACCTACG CATCTCCTCC TCCTTGGACA TTCTTTTGTC TTCATTTGGT GCTTTTGGTT TTGCACCTCT CCGTGATTGT 

7930 7940 7950 7960 7970 7980 7990 8000 8010 
*♦**»**♦« 

ACCCCTACCA GCATGTTATA GGGCAAGACC TTTGTGCTTT TGATCATTCT CGCCCATCAA AGCAACTTTG GTCTCCTTTC CCCTCCTGTC 

8020 8030 8040 8050 8060 8070 8080 8090 8100 
**♦»♦♦♦*♦ 
TTCCCGGTAT CCCTTGGAGT CTCACAAGGT TTACTTTGGT ATGGTTCTCA GCACAAACCT TTCAAGTATG TTGTTTCTTT GGAAAATGGA 

8110 8120 8130 8140 8150 8160 8170 8180 8190 

♦ **♦»♦♦♦♦ 

CATACTGTAT TGTGTTCTCC TGCATATATC ATTCCTGGAG AGAGAAGGGG AGAAGAATAC TTTTCTTCAA CAAATTTTGG GGGCAGGAGA 

8200 8210 8220 8230 8240 8250 8260 8270 8280 
♦♦***♦♦♦* 
TCCCTTCAAG AGGCTGCACC TTAATTTTTC TTGTCTGTGT GCAGGTCTTC ATATAAACTT TACCAGGAAG AAGGGTGTGA CTTTGTTGTT 
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8290 8300 8310 8320 8330 8340 8350 8360 8370 

ITTCTGTGTA TGGGCCTGGT CAGTGTAAAG TTTTATCCTT GATAGTCTAG TTACTATGAC CCTCCCCACT TTTTTAAAAC CAGAAAAAGG 

8380 8390 8400 8410 8420 ' 8430 8440 8450 8460 
•**♦*»»«♦ 

TTTGGAATGT TGGAATGACC AAGAGACAAG TTAACTOGTG CAAGAGCCAG TTACCCACCC ACAGGTCCCC CTACTTCCTG CCAAGCATTC 

8470 8480 8490 8500 8510 8520 8530 8540 8550 
**♦*»»»*♦ 
CATTGACTGC CTGTATGGAA CACATTTGTC CCAGATCTGA GCATTCTAGG CCTGTTTCAC TCACTCACCC AGCATATGAA ACTAGTCTTA 

8560 8570 8580 8590 8600 8610 8620 8630 8640 

* * ♦ * » * * ♦ , 

ACTGTTGAGC CTTTCCTTTC ATATCCACAG AAGACACTGT CTCAAATGTT GTACCCTTGC CATTTAGGAC TGAACTTTCC TTAGCCCAAG 

t 

8650 8660 8670 8680 8690 8700 8710 8720 8730 

♦ * ♦ ♦ ♦ ♦ • ■ • ♦ 
GGACCCAGTG ACAGTTGTCT TCCGTTTGTC AGATGATCAG TCTCTACTGA TTATCTTCCT GCTTAAAGGC CTGCTCACCA ATCTTTCTTT 

8740 8750 8760 8770 8780 8790 8800 8810 8820 
♦*♦**«»», 
CACACCGTGT GGTCCGTGTT ACTGGTATAC CCAGTATGTT CTCACTGAAG ACATGGACTT TATATGTTCA AGTGCAGGAA TTGGAAAGTT 

8830 8840 8850 8860 8870 8880 8890 8900 8910 

♦ * « ♦ ♦ ♦ ♦ . , 
GGACTTGTTT TCTATGATCC AAAACAGCCC TATAAGAAGG TTGGAAAAGG AGGAACTATA TAGCAGCCTT TGCTATTTTC TGCTACCATT 

8920 8930 8940 8950 8960 8970 8980 8990 9000 
♦♦♦♦»♦♦», 
TCTTTTCCTC TGAAGCGGCC ATGACATTCC CTTTGGCAAC TAACGTAGAA ACTCAACAGA ACATTTTCCT TTCCTAGAGT CACCTTTTAG 

9010 9020 9030 9040 9050 9060 9070 9080 9090 

• * « « ♦ ♦ ♦ * , 
ATGATAATCG ACAACTATAG ACTTGCTCAT TGTTCAGACT GATTGCCCCT CACCTGAATC CACTCTCTGT ATTCATGCTC TTGGCAATTT 

9100 9110 9120 9130 9140 9150 9160 9170 9180 

* ♦ * ♦ » * » , , 
CTTTGACTTT CTTTTAAGGG CAGAAGCATT TTAGTTAATT GTAGATAAAG AATAGTTTTC TTCCTCTTCT CCTTGGGCCA GTTAATAATT 

9190 9200 9210 9220 9230 9240 9250 9260 9270 
♦♦♦♦** ♦ ♦ ♦ 

GGTCCATGGC TACACTCCAA CTTCCGTCCA GTGCTGTGAT GCCCATGACA CCTGCAAAAT AAGTTCTGCC TGGGCATTTT fiTAfiATATTA 
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9280 9290 9300 9310 9320 9330 9340 9350 9360 

♦ »»«««*»» 
. ACAGGTGAAT TCCOGACTCT TTTGGTTTGA ATGACAGTTC TCATTCCTTC TATGGCTGCA AGTATGCATC AGTGCTTCCC ACTTACCIGA 

9370 9380 9390 9400 9410 9420 9430 9440 9450 
*♦*«»«»»» 
TTTGTCTGTC GGTGGCCCCA TATGGAAACC CTGCGTGTCT GTTGGCATAA TAGTTIACAA ATGGTTTTTT CAGTCCTATC CAAATTTATT 

9460 9470 9480 9490 9500 9510 9520 9530 9540 

* » * * * * » * * 
GAACCAACAA AAATAATTAC TTCTGCCCTG AGATAAGCAG ATTAAGTTTG TTCATTCTCT GCTTTATTCT CTCCATGTGG CAACATTCTG 

9550 9560 9570 9580 9590 9600 9610 9620 9630 
********* 
TCAGCCTCTT TCATAGTGTG CAAACATTTT ATCATTCTAA ATGGTGACTC TCTGCCCTTG GACCCATTTA TTATTCACAG ATGGGGAGAA 

9640 9650 9660 9670 9680 9690 9700 9710 9720 
********* 
CCTATCTGCA TGGACCCTCA CCATCCTCTG TGCAGCACAC ACAGTGCAGG GAGCCAGTGG CGATGGCGAT GACTTTCTTC CCCTGGGAAT 



TCC 

FIG.17L 



